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Abstract. The Frechet distance is a metric to compare two curves, 
which is based on monotonous matchings between these curves. We call a 
matching that results in the Frechet distance a Frechet matching. There 
are often many different Frechet matchings and not all of these capture 
the similarity between the curves well. We propose to restrict the set 
of Frechet matchings to "natural" matchings and to this end introduce 
locally correct Frechet matchings. We prove that at least one such match- 
ing exists for two polygonal curves and give an 0(N 3 \ogN) algorithm 
to compute it, where N is the total number of edges in both curves. We 
also present an 0(N 2 ) algorithm to compute a locally correct discrete 
Frechet matching. 

1 Introduction 

Many problems ask for the comparison of two curves. Consequently, several 
distance measures have been proposed for the similarity of two curves P and Q, 
for example, the Hausdorff and the Frechet distance. Such a distance measure 
simply returns a number indicating the (dis)similarity. However, the Hausdorff 
and the Frechet distance are both based on matchings of the points on the curves. 
The distance returned is the maximum distance between any two matched points. 
The Frechet distance uses monotonous matchings (and limits of these): if point 
p on P and q on Q are matched, then any point on P after p must be matched to 
q or a point on Q after q. The Frechet distance is the maximal distance between 
two matched points minimized over all monotonous matchings of the curves. 
Restricting to monotonous matchings of only the vertices results in the discrete 
Frechet distance. We call a matching resulting in the (discrete) Frechet distance 
a (discrete) Frechet matching. See Section [2] for more details. 

There are often many different Frechet matchings for two curves. However, 
as the Frechet distance is determined only by the maximal distance, not all 
of these matchings capture the similarity between the curves well (see Fig. [IJ. 
There are applications that directly use a matching, for example, to map a 
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Fig. 1. Two Frechet matchings for curves P and Q. 



GPS track to a street network [5] or to morph between the curves Q]. In such 
situations a "good" matching is important. We believe that many applications 
of the (discrete) Frechet distance, such as protein alignment [9] and detecting 
patterns in movement data [3J, would profit from good Frechet matchings. 

Results. We restrict the set of Frechet matchings to "natural" matchings by in- 
troducing locally correct Frechet matchings: matchings that for any two matched 
subcurves are again a Frechet matching on these subcurves. In Section[3]we prove 
that there exists such a locally correct Frechet matching for any two polygonal 
curves. Based on this proof we describe in Section[4]an 0(N 3 logiV) algorithm to 
compute such a matching, where N is the total number of edges in both curves. 
We consider the discrete Frechet distance in Section [5] and give an 0(N 2 ) algo- 
rithm to compute locally correct matchings under this metric. 

Related work. The first algorithm to compute the Frechet distance was given 
by Alt and Godau pQ. They also consider a non-monotone Frechet distance 
and their algorithm for this variant results in a locally correct non-monotone 
matching (see Remark 3.5 in [6]). Eiter and Mannila gave the first algorithm 
to compute the discrete Frechet distance [5]. Since then, the Frechet distance 
has received significant attention. Here we focus on approaches that restrict the 
allowed matchings. Efrat et al. [I] introduced Frechet-like metrics, the geodesic 
width and link width, to restrict to matchings suitable for curve morphing. Their 
method is suitable only for non-intersecting polylines. Moreover, geodesic width 
and link width do not resolve the problem illustrated in Fig. [T] both matchings 
also have minimal geodesic width and minimal link width. Maheshwari et al. [7] 
studied a restriction by "speed limits" , which may exclude all Frechet matchings 
and may cause undesirable effects near "outliers" (see Fig. [2]) . Buchin et al. [2] 
describe a framework for restricting Frechet matchings, which they illustrate by 
restricting slope and path length. The former corresponds to speed limits. We 
briefly discuss the latter at the end of Section |4| 



Ql 

Fig. 2. Two Frechet matchings. Right: the result of speed limits is not locally correct. 
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2 Preliminaries 



Curves. Let P be a polygonal curve with m edges, denned by vertices po, . . . , p m . 
We treat a curve as a continuous map P : [0, m] — >• M. d . In this map, P(i) equals 
Pi for integer i. Furthermore, P(i + X) is a parameterization of the (i + l)st edge, 
that is, P(i + A) = (1 — A) • Pi + A • p»+i, for integer i and < A < 1. As a 
reparametrization a : [0, 1] — > [0, m] of a curve P, we allow any continuous, non- 
decreasing function such that cr(0) = and c(l) = m. We denote by P a {t) the 
actual location according to reparametrization a: P a {t) = P(a(t)). By P a \a, b] 
we denote the subcurve of P in between P a (a) and Po-(6). In the following we are 
always given two polygonal curves P and Q, where Q is defined by its vertices 
Qo, . . . , q n and is reparametrized by 9 : [0, 1] — ¥ [0, n]. The reparametrized curve 
is denoted by Q@. 

Frechet matchings. We are given two polygonal curves P and Q with m and n 
edges. A (monotonous) matching fi between P and Q is a pair of reparametriza- 
tions (a, 6), such that P a {t) matches to Qg(t). The Euclidean distance between 
two matched points is denoted by d M (t) = \P a {t) — Qe{t)\. The maximum dis- 
tance over a range is denoted by d M [a, b] = max a < f <(, d^(t). The Frechet distance 
between two curves is defined as §f(P, Q) = inf M c? M [0, 1]. A Frechet matching is 
a matching fi that realizes the Frechet distance: ^[0, 1] = Sp(P, Q) holds. 

Free space diagrams. Alt and Godau p] describe an algorithm to compute the 
Frechet distance based on the decision variant (that is, solving 5p(P,Q) < e 
for some given e). Their algorithm uses a free space diagram, a two-dimensional 
diagram on the range [0, m] x [0, n]. Every point (x,y) in this diagram is either 
"free" (white) or not (indicating whether \P(x) — Q(y)\ < £)■ The diagram has 
m columns and n rows; every cell (c, r) (1 < c < m and 1 < r < n) corresponds 
to the edges p c -\p c and q r -\q r . To compute the Frechet distance, one finds the 
smallest e such that there exists an x- and y-monotone path from point (0, 0) to 
(m, n) in free space. For this, only certain critical values for the distance have 
to be checked. Imagine continuously increasing the distance e starting at e = 0. 
At so-called critical events, which are illustrated in Fig. [3j passages open in the 
free space. The critical values are the distances corresponding to these events. 




Fig. 3. Three event types. (A) Endpoints come within range of each other. (B) Passage 
opens on cell boundary. (C) Passage opens in row (or column). We scale every row and 
column in the diagram to correspond to the (relative) length of the actual edge of the 
curve instead of using unit squares for cells. 
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3 Locally correct Frechet matchings 



We introduce locally correct Frechet matchings, for which the matching between 
any two matched subcurves is a Frechet matching. 

Definition 1 (Local correctness). Given two polygonal curves P and Q, a 
matching /i = (a, 9) is locally correct if for all a, b with < a < b < 1 

d^[a,b] = 5p(P a [a,b],Qe[a,b]). 

Note that not every Frechet matching is locally correct. See for example Fig. [2} 
The question arises whether a locally correct matching always exists and if so, 
how to compute it. We resolve the first question in the following theorem. 

Theorem 1. For any two polygonal curves P and Q, there exists a locally cor- 
rect Frechet matching. 

Existence. We prove Theorem [T] by induction on the number of edges in the 
curves. First, we present the lemmata for the two base cases: one of the two 
curves is a point, and both curves are line segments. In the following, n and m 
again denote the number of edges of P and Q, respectively. 

Lemma 1. For two polygonal curves P and Q with m — 0, a locally correct 
matching is (a, 9), where a(t) = and 9(t) — t ■ n. 

Proof. Since m — 0, P is just a single point, pg. The Frechet distance between a 
point and a curve is the maximal distance between the point and any point on 
the curve: 5p(p Q ^ Qe[a, b]) = d^[a, b]. This implies that the matching \i is locally 
correct. □ 

Lemma 2. For two polygonal curves P and Q with m = n = 1, a locally correct 
matching is (o~,9), where a(t) = 9(t) = t. 

Proof. The free space diagram of P and Q is a single cell and thus the free 
space is a convex area for any value of e. Since \x = (a, 9) is linear, we have that 
d M [a, b] = max{d (1 (a), d M (6)}: if there would be a t with a < t < b such that 
dft(t) > max {dfj, (a), ^(6)}, then the free space at e = max {^(a), would 
not be convex. Since d^a^b] ~ max{d M (a), < SF(P a [a, b], Qe[a, b}) < 

d^a, b], we conclude that /x is locally correct. □ 

For induction, we split the two curves based on events (see Fig. Q). Since 
each split must reduce the problem size, we ignore any events on the left or 
bottom boundary of cell (1,1) or on the right or top boundary of cell (m,n). 
This excludes both events of type A. A free space diagram is connected at value 
e, if a monotonous path exists from the boundary of cell (1, 1) to the boundary 
of cell (m,n). A realizing event is a critical event at the minimal value e such 
that the corresponding free space diagram is connected. 

Let £ denote the set of concurrent realizing events for two curves. A realizing 
set E r is a subset of £ such that the free space admits a monotonous path from 
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Fig. 4. (a) Curves with the free space diagram for e = <5f(P, Q) and the realizing event, 
(b) The event splits each curve into two subcurves. The hatched areas indicate parts 
that disappear after the split. 

cell (1, 1) to cell (m, n) without using an event in £\E T . Note that a realizing set 
cannot be empty. When £ contains more than one realizing event, some may be 
"insignificant" : they are never required to actually make a path in the free space 
diagram. A realizing set is minimal if it does not contain a strict subset that is 
a realizing set. Such a minimal realizing set contains only "significant" events. 

Lemma 3. For two polygonal curves P and Q with m > 1 and n > 1, there 
exists a minimal realizing set. 

Proof. Let £ denote the non-empty set of concurrent events at the minimal 
critical value. By definition, the empty set cannot be a realizing set and £ is a 
realizing set. Hence, £ contains a minimal realizing set. □ 

The following lemma directly implies that a locally correct Frechet matching 
always exists. Informally, it states that curves have a locally correct matching 
that is "closer" (except in cell (1, 1) or (to, n)) than the distance of their realizing 
set. Further, this matching is linear inside every cell. In the remainder, we use 
realizing set to indicate a minimal realizing set, unless indicated otherwise. 

Lemma 4. // the free space diagram of two polygonal curves P and Q is con- 
nected at value e, then there exists a locally correct Frechet matching /i = (a, 6) 
such that dfj,(t) < e for all t with o~(t) > 1 or 0{t) > 1, and o~(t) < m — 1 or 
6(t) < n — 1. Furthermore, fi is linear in every cell. 

Proof. We prove this by induction on m + n. The base cases (m = 0, n = 0, and 
to = n = 1) follow from Lemma [I] and Lemma [2] 

For induction, we assume that m > 1, n > 1, and m + n > 2. By Lemma [3j 
a realizing set E T exists for P and Q, say at value e r . The set contains realizing 
events e%, . . . , (k > 1), numbered in lexicographic order. By definition, e r < 
e holds. Suppose that E r splits curve P into Pi, . . . , Pk+i and curve Q into 
Qi, . . . , Qk+i, where Pi has rm edges, Qi has ni edges. By definition of a realizing 
event, none of the events in E r occur on the right or top boundary of cell (to, n). 
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Hence, for any i (1 < i < k + 1), it holds that rrn < m, rn < n, and rrii < m 
or rii < n. Since a path exists in the free space diagram at e r through all events 
in E r , the induction hypothesis implies that, for any i (1 < i < k + 1), a locally 
correct matching fi t = {cn,9i) exists for Pi and Qi such that fii is linear in every 
cell and < e r for all t with ai{t) > 1 or > 1, and er^i) < mi — 1 

or < tii ~ 1. Combining these matchings with the events in E r yields a 

matching [i = (cr, 9) for (P, Q). 

As we argue below, this matching is locally correct and satisfies the additional 
properties. The matching of an event corresponds to a single point (type B) or 
a horizontal or vertical line (type C). By induction, fii is linear in every cell. 
Since all events occur on cell boundaries, the cells of the matchings and events 
are disjoint. Therefore, the matching fj, is also linear inside every cell. 

For i < k + 1, d lli is at most e r at the point where jii enters cell (m^n^) 
in the free space diagram of Pi and Qi- We also know that d fli equals e r at the 
top right corner of cell (m^rti). Since iii is linear inside the cell, d^^t) < s r also 
holds for t with tJi(t) > rrii — 1 and 6i(t) > n, — 1. Analogously, for i > 0, 
is at most e r for t with <Ji(t) < 1 and 6i(t) < 1. Hence, d^(t) < e r < e holds for 
t with a(t) > 1 or 6>(t) > 1, and a(t) < m - 1 or 0(t) < n - 1. 

To show that fi is locally correct, suppose for contradiction that values a, b 
exist such that Sf {P a [a, b] , Qg [a, 6] ) < rf M [a, 6] . If a, 6 are in between two consecu- 
tive events, we know that the submatching corresponds to one of the matchings 
jtij. Since these are locally correct, 5p(P (T [a, b], Qe[a, b]) — d^[a, b] must hold. 

Hence, suppose that a and b are separated by at least one event of E T . There 
are two possibilities: either d^a, 6] = e r or c? M [a, b] > e r . d^[a, b] < e r cannot 
hold, since d M [a, 6] includes a realizing event. First, assume d M [a, 6] = e r holds. 
If 5f(-P<t[0) <5e[a, &]) < holds, then a matching exists that does not use 
the events between a and b and has a lower maximum. Hence, the free space 
connects point {a (a), 0(a)) with point (<r(b), 9(b)) at a lower value than e r . This 
implies that all events between a and b can be omitted, contradicting that E T is 
a minimal realizing set. 

Now, assume d M [a, b] > e r . Let t' denote the highest t for which a(t) < 1 and 
9(t) < 1 holds, that is, the point at which the matching exits cell (1, 1). Similarly, 
let t" denote the lowest t for which a(t) > m — 1 and 6(t) > n — 1 holds. Since 
du(t) < e r holds for any t' < t < t" , dAt) > e r can hold only for t < t' or t > t" . 
Suppose that d^(a) > e r holds. Then a < t' holds and \i is linear between a 
and t'. Therefore, d^(a) > d^(t) holds for any t with a < t < t'. Analogously, 
if d^b) > e r holds, then d M (6) > holds for any t with t" < t < b . Hence, 

dfj\a,b\ = max{d M (a), must hold. 

This maximum is a lower bound on the Frechet distance, contradicting the 
assumption that d^ [a, b] is larger than the Frechet distance. Matching ii is there- 
fore locally correct. □ 
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4 Algorithm for locally correct Frechet matchings 

The existence proof directly results in a recursive algorithm, which is given by 
Algorithm [I] Fig. [I] (left), Fig. [| (left), Fig. [5} Fig. [6j and Fig. [7] (left) illustrate 
matchings computed with our algorithm. This section is devoted to proving the 
following theorem. 

Theorem 2. Algorithm [7] computes a locally correct Frechet matching of two 
polygonal curves P and Q with m and n edges in 0{(m + n)mn log(mn)) time. 



Algorithm 1 ComputeLCFM(P, Q) 
Require: P and Q are curves with m and n edges 
Ensure: A locally correct Frechet matching for P and Q 
1: if m = or n — then 

2: return (cr, 9) where a(t) — t ■ m, 8(t) = t ■ n 

3: else if m = n — 1 then 

4: return (cr, 6) where o(t) = 9(t) = t 

5: else 

6: Find event e r of a minimal realizing set 
7: Split P into Pi and P2 according to e r 
8: Split Q into Qi and Q2 according to e r 
9: m -> ComputeLCFM(Pi,Qi) 
10: ^2 ComputeLCFM(P 2 , Q 2 ) 
11: return concatenation of /xi, e r , and /12 



Using the notation of Alt and Godau [T], Lf^ denotes the interval of free 
space on the left boundary of cell Lfj denotes the subset of Lfj that is 

reachable from point (0, 0) of the free space diagram with a monotonous path in 
the free space. Analogously, Bf^ and are defined for the bottom boundary. 

With a slight modification to the decision algorithm, we can compute the 
minimal value of e such that a path is available from cell (1,1) to cell (m, n). 
This requires only two changes: Bf 2 should be initialized with B[ 2 and L 2 \ 
with L 21 ; the answer should be "yes" if and only if B^ n or L^ l n is non-empty. 
Realizing set. By computing the Frechet distance using the modified Alt and 
Godau algorithm, we obtain an ordered, potentially non-minimal realizing set 




Fig. 5. Locally correct matching produced by Algorithm [T] Free space diagram drawn 
at e = S F (P,Q). 



7 



£ = {ei , . . . , e;}. The algorithm must find an event that is contained in a realizing 
set. Let Ef. denote the first k events of £. For now we assume that the events 
in £ end at different cell boundaries. We use a binary search on £ to find the r 
such that E r contains a realizing set, but E r -\ docs not. This implies that event 
e r is contained in a realizing set and can be used to split the curves. Note that 
r is unique due to monotonicity. For correctness, the order of events in £ must 
be consistent in different iterations, for example, by using a lexicographic order. 
Set E r contains only realizing sets that use e r . Hence, £y_i contains a realizing 
set to connect cell (1,1) to e r and e r to cell (rn, n). Thus any event found in 
subsequent iterations is part of E r _\ and of a realizing set with e r . 

To determine whether some E^ contains a realizing set, we check whether 
cells (1, 1) and (m, n) are connected without "using" the events of £\Ek- To do 
this efficiently, we further modify the Alt and Godau algorithm. We require only 
a method to prevent events in £\Ek from being used. After Lf-j is computed, we 
check whether the event e (if any) that ends at the left boundary of cell («, j) is 
part of £\Ek and necessary to obtain Efy If this is the case, we replace Lf-j with 
an empty interval. Event e is necessary if and only if Lf j is a singleton. To obtain 
an algorithm that is numerically more stable, we introduce entry points. The 
entry point of the left boundary of cell is the maximal i' < i such that ■ 
is non-empty. These values are easily computed during the decision algorithm. 
Assume the passage corresponding to event e starts on the left boundary of cell 
(i s ,j). Event e is necessary to obtain L^- if and only if i' < i a . Therefore, we 
use the entry point instead of checking whether Lf^ is a singleton. This process 
is analogous for horizontal boundaries of cells. 

Earlier we assumed that each event in £ ends at a different cell boundary. If 
events end at the same boundary, then these occur in the same row (or column) 
and it suffices to consider only the event that starts at the rightmost column (or 
highest row). This justifies the assumption and ensures that £ contains O(mn) 
events. Thus computing e r (Algorithm [I] line [6]) takes 0{mn\og{mn)) time, 
which is equal to the time needed to compute the Frechet distance. Each re- 
cursion step splits the problem into two smaller problems, and the recursion 
ends when mn < 1. This results in an additional factor m + n. Thus the overall 
running time is 0((m + n)mn log(mn)). 

Sampling and further restrictions. Two curves may still have many locally 
correct Frechet matchings: the algorithm computes just one of these. However, 
introducing extra vertices may alter the result, even if these vertices do not 
modify the shape (see Fig. [6]) . This implies that the algorithm depends not only 



po O p', O p"o o O 

o □ Q c ' — □ Q o o Q 




Fig. 6. Different sampling may result in different matchings. 
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Fig. 7. Two locally correct Frechet matchings for P and Q. Right: shortest matching. 

on the shape of the curves, but also on the sampling. Increasing the sampling 
further and further seems to result in a matching that decreases the matched 
distance as much as possible within a cell. However, since cells are rectangles, 
there is a slight preference for taking longer diagonal paths. Based on this idea, 
we are currently investigating "locally optimal" Frechet matchings. The idea is 
to restrict to the locally correct Frechet matching that decreases the matched 
distance as quickly as possible. 

We also considered restricting to the "shortest" locally correct Frechet match- 
ing, where "short" refers to the length of the path in the free space diagram. 
However, Fig. [7] shows that such a restriction does not necessarily improve the 
quality of the matching. 

5 Locally correct discrete Frechet matchings 

Here we study the discrete variant of Frechet matchings. For the discrete Frechet 
distance, only the vertices of curves are matched. The discrete Frechet distance 
can be computed in 0(m ■ n) time via dynamic programming [5]. Here, we show 
how to also compute a locally correct discrete Frechet matching in 0(m-n) time. 

Grids. Since we are interested only in matching vertices of the curves, we can 
convert the problem to a grid problem. Suppose we have two curves P and Q 
with m and n edges respectively. These convert into a grid G of non-negative 
values with m + 1 columns and n + 1 rows. Every column corresponds to a 
vertex of P, every row to a vertex of Q. Any node of the grid G[i, j] corresponds 
to the pair of vertices (pi,qj)- Its value is the distance between the vertices: 
— \pi — Analogous to free space diagrams, we assume that G[0,0] is 
the bottomleft node and G[m, n] the topright node. 

Matchings. A monotonous path n is a sequence of grid nodes 7r(l), . . . , ir{k) such 
that every node tt(i) (1 < i < k) is the above, right, or above/right diagonal 
neighbor of Tr(i— 1). In the remainder of this section a path refers to a monotonous 
path unless indicated otherwise. A monotonous discrete matching of the curves 
corresponds to a path it such that 7r(l) = G[0,0] and 7r(fc) = G[m,n\. We 
call a path it locally correct if for all 1 < ti < t 2 < fc, max tl < 4 < t2 ir(t) = 
min w / maxi<«(;' ^(t), where n' ranges over all paths starting at tt'(I) = Tr(ti) 
and ending at Tr'(k') — irfa). 

Algorithm. The algorithm needs to compute a locally correct path between 
G[0,0] and G[m,n] in a grid G of non-negative values. To this end, the al- 



Algorithm 2 ComputeDiscreteLCFM(P, Q) 

Require: P and Q are curves with m and n edges 

Ensure: A locally correct discrete Frechet matching for P and Q 

1: Construct grid G for P and Q 

2: Let T be a tree consisting only of the root G[0, 0] 

3: for i <- 1 to m do 

4: AddG[i,0]toT 

5: for j f- 1 to n do 

6: AddG[0,j]toT 

7: for i <s— 1 to m do 

8: for j ' <— 1 to n do 

9: AddToTree(T, G,i,j) 

10: return path in T between G[0, 0] and G[m, n] 



gorithm incrementally constructs a tree T on the grid such that each path in 
T is locally correct. The algorithm is summarized by Algorithm [2j Wc define 
a growth node as a node of T that has a neighbor in the grid that is not yet 
part of T: a new branch may sprout from such a node. The growth nodes form 
a sequence of horizontally or vertically neighboring nodes. A living node is a 
node of T that is not a growth node but is an ancestor of a growth node. A 
dead node is a node of T that is neither a living nor a growth node, that is, it 
has no descendant that is a growth node. Every pair of nodes in this tree has a 
nearest common ancestor (NCA). When we have to decide what parent to use 
for a new node in the tree, we look at the maximum value on the path in the 
tree between the parents and their NCA (excluding the value of the latter). A 
face of the tree is the area enclosed by the segment between two horizontally or 
vertically neighboring growth nodes (without one being the parent of another) 
and the paths to their NCA. The unique sink of a face is the node of the grid 
that is in the lowest column and row of all nodes on the face. Fig. [8] (a-b) shows 
some examples of faces and their sinks. 

Shortcuts. To avoid repeatedly walking along the tree to compute maxima, 
we maintain up to two shortcuts from every node in the tree. The segment 
between the node and its parent is incident to up to two faces of the tree. The 



Fig. 8. (a) Face of tree (gray area) with its unique sink (solid dot). A dashed line 
represents a dead path, (b) Two adjacent faces with some shortcuts indicated, (c) Tree 
with 3 faces. Solid dots indicate growth nodes with a growth node as parent. These 
nodes are incident to at most one face. All shortcuts of these nodes are indicated. 
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Algorithm 3 AddToTree(T, G, i, j) 

Require: G is a grid of non-negative values; any path in tree T is locally correct 
Ensure: node G[i,j] is added to T and any path in T is locally correct 

1: parent(G[i, j]) «— candidate parent with lowest maximum value to NCA 

2: if G[i — 1, j — 1] is dead then 

3: Remove the dead path ending at G[i — 1, j — 1] and extend shortcuts 
4: Make shortcuts for G[i — G[i,j — 1], and G[i,j] where necessary 



node maintains shortcuts to the sink of these faces, associating the maximum 
value encountered on the path between the node and the sink (excluding the 
value of the sink). Fig. [8] (b) illustrates some shortcuts. With these shortcuts, 
it is possible to determine the maximum up to the NCA of two (potentially 
diagonally) neighboring growth nodes in constant time. 

Note that a node g of the tree that has a growth node as parent is incident 
to at most one face (see Fig. [8] (c)). We need the "other" shortcut only when 
the parent of g has a living parent. Therefore, the value of this shortcut can be 
obtained in constant time by using the shortcut of the parent. When the parent 
of g is no longer a growth node, then g obtains its own shortcut. 

Extending the tree. Algorithm [3] summarizes the steps required to extend the 
tree T with a new node. Node G[i, j] has three candidate parents, G[i — 1, j], 
G\i — l,j — 1], and G[i,j — 1]. Each pair of these candidates has an NCA. 
For the actual parent of G[i,j], we select the candidate c such that for any 
other candidate c', the maximum value from c to their NCA is at most the 
maximum value from d to their NCA — both excluding the NCA itself. We must 
be consistent when breaking ties between candidate parents. To this end, we use 
the preference order of G[i — >~ G[i — l,j — 1] >- G[i,j — 1]. Since paths 
in the tree cannot cross, this order is consistent between two paths at different 
stages of the algorithm. Note that a preference order that prefers G[i — 1, j — 1] 
over both other candidates or vice versa results in an incorrect algorithm. 

When a dead path is removed from the tree, adjacent faces merge and a sink 
may change. Hence, shortcuts have to be extended to point toward the new sink. 
Fig. [9] illustrates the incoming shortcuts at a sink and the effect of removing a 
dead path on the incoming shortcuts. Note that the algorithm does not need to 
remove dead paths that end in the highest row or rightmost column. 




Fig. 9. (a) Each sink has up to four sets of shortcuts, (b-d) Removing a dead path 
(dashed) extends at most one set of shortcuts. 
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Finally, G[i — 1, j], G[i, j — 1], and G[i,j] receive shortcuts where necessary. 
G[i — 1, j] or G[i,j — 1] needs a shortcut only if its parent is G[i — 1, j — 1] . G[z, 
needs two shortcuts if G[i — 1, j — 1] is its parent, only one shortcut otherwise. 

Correctness. To prove correctness of Algorithm |2j we require a stronger version 
of local correctness. A path tt is strongly locally correct if for all paths n' with 
the same endpoints maxi<t<fc n(t) < maxi <t /<fc/ n'(t') holds. Note that the first 
node is excluded from the maximum. Since maxi <t <fc w(t) < ruaxi<t'<fe' ^'(f) 
and 7r(l) = 7r'(l) imply maxi< t <fc n(t) < maxi<i/</.< 7r'(i'), a strongly locally 
correct path is also locally correct. Lemma [5] implies the correctness of Algo- 
rithm [2] 

Lemma 5. Algorithm [1] maintains the following invariant: any path in T is 
strongly locally correct. 

Proof. To prove this lemma, we strengthen the invariant. 

Invariant. We are given a tree T such that every path in T is strongly locally 
correct. In constructing T, any ties were broken using the preference order. 

Initialization. Tree T is initialized such that it contains two types of paths: 
either between grid nodes in the first column or in the first row. In both cases 
there is only one path between the endpoints of the path. Therefore, this path 
must be strongly locally correct. Since every node has only one candidate parent, 
T adheres to the preference order. 

Maintenance. The algorithm extends T to T' by including node g = G[i,j]. 
This is done by connecting g to one of its candidate parents (G[i — 1, j], G[i — 
l,j — 1], or G[i,j — 1]), the one that has the lowest maximum value along its 
path to the NCA. We must now prove that any path in T' is strongly locally 
correct. From the induction hypothesis, we conclude that only paths that end at 
g could falsify this statement. 

Suppose that such an invalidating path exists in T' , ending at g. This path 
must use one of the candidate parents of g as its before-last node. We distinguish 
three cases on how this path is situated compared to T". The last case, however, 
needs two subcases to deal with candidate parents that have the same maximum 
value on the path to their NCA. The four cases are illustrated in Fig. [lOj 

For each case, we consider the path 7Tj between the first vertex and the 
parent of g in the invalidating path (i.e. one of the three candidate parents). 
Note that 7Tj need not be disjoint of the paths in T'. Slightly abusing notation, 
we also use a path n' to denote its maximum value, excluding the first node, i.e. 
maxi<i<fc/ 7r'(t). We now show that for each case, the existence of the invalidating 
path contradicts the invariant on T. 

Case (a). Path 7Tj ends at the parent of g in T' . Path ir is the path in T' 
between the first and last vertex of 7Tj. Since {iri,g) is the invalidating path, 
we know that m&x{iri,g} < max{Tr,g} holds. This implies that 7Ti < it holds. 
In particular, this means that n, a path in T, is not strongly locally correct: a 
contradiction. 

Case (b). Path iri ends at a non-selected candidate parent of g. Path TT2 ends 
at the parent of g in T" and path 7T3 ends at the last vertex of TTj. Let nca denote 
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the NCA of the endpoints of 7r 2 and 773. The first vertex of 71"; is nca or one of 
its ancestors. Both path 7r 2 and 7r 3 start at nca. Let ~K\ be the path from 7Tj(l) 
to nca. Since the endpoint of 7r 2 was chosen as parent over the endpoint of 7r 3 , 
we know that 7r 2 < holds. Furthermore, since (ni,g) is the invalidating path, 
we know that max{7Ti,g} < max{7Ti, 7r 2 , g} holds. These two inequalities imply 
max{7Ti,(?} < max{7Ti, 7T3, g} holds. This in turn implies that i\\ < max{7Ti, 773} 
must hold. Since (771,713) is a path in T and the inequality implies that it is not 
strongly locally correct, we again have a contradiction. 

Case (c). Path tt\ ends at a non-selected candidate parent of g. Path 7r 2 ends 
at the parent of g in T' and path 7r 3 ends at the last vertex of -k\. Let nca denote 
the NCA of the endpoints of 7r 2 and 7r 3 . The first vertex of 7Ti is a descendant of 
nca. Path 7r 2 starts at 7Ti(l) and 773 starts at nca. Let tt\ be the path from nca 
to 7T;(1). In this case, we must explicitly consider the possibility of two paths 
having equal values. Hence, we distinguish two subcases. 

Case (c-1). In the first subcase, we assume that the endpoint of 7r 2 was cho- 
sen as parent since its maximum value is strictly lower: max{7Ti, 7r 2 } < 773 holds. 
Since (^i,g) is the invalidating path, we know that max{7ri,g} < max{7r 2 ,<7} 
holds. Since 7r 2 < max{7Ti,7r 2 } always holds, we obtain that max{7Ti,<7} < 
max{7T3,(7} must hold. This in turn implies that t\\ < 773 holds. Similarly, since 
tti < max{7Ti, 7T 2 }, we know that ni < 7r 3 must hold. Combining these last two 
inequalities yields max{7ri,7Ti} < 7r 3 . Since 7r 3 is a path in T and the inequality 
implies that it is not strongly locally correct, we again have a contradiction. 
(Note that with max{7Ti,7r 2 } < 773, we can at best derive max{7Ti, 7Tj} < 773 
which is not strong enough to contradict the invariant on T.) 

Case (c-2). In the second subcase, we assume that the endpoint of 7r 2 was 
chosen as parent based on the preference order: the maximum values are equal, 
thus max{7Ti,7r 2 } = 7r 3 holds. We now subdivide 7T; into two parts, 7Ti a and 
■^ib- ^ia runs from the first vertex of n\ up to the first vertex along 7Tj that 
is an ancestor in T' of candidate parent of g that is used by 7Tj. At the same 
point, we also split path 773 into 7T3 a and 7T3b- We now obtain two more cases, 
7r3 a < max{7Ti, 7Tj a } and 7T3 a > max{7Ti, 7Ti a }. In the former case, we obtain that 
max{7T3 a , 7Tjb} < max{7Ti,7r 2 } holds and thus (7T3 a , 7T;b, g) is also an invalidating 
path. Since this path starts at the NCA of 7r 2 and 7r 3 , this is already covered 
by case (b). In the latter case, we have that either path 7r 3a — which is in T — is 
not strongly locally correct (contradicting the induction hypothesis) or there is 
equality between the two paths (7Ti, 7Ti a ) and 7T3 a . In case of equality, we observe 




(a) (b) (c-1) (c-2) 

Fig. 10. The four cases for the proof of Lemma [5] 
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'• (a) 



* (b) 



Fig. 11. (a) A dead path 7Td and the corresponding path requiring extensions 7r e . 
(b) Endpoint of 7r has outdegree 1. None of its descendant has a shortcut to 7r e (l). 



that (7ri,7Tj a ) and 7T3 a arrive at their endpoint in the same order as 7T2 and 7r 3 
arrive at g. Thus T does not adhere to the preference order to break ties. This 
contradicts the invariant. □ 

Execution time. When a dead path tt^ is removed, we may need to extend a 
list of incoming shortcuts at 7Td(l), the node that remains in T. Let k denote 
the number of nodes in ir^- The lemma below relates the number of extended 
shortcuts to the size of ir^. The main observation is that the path requiring 
extensions starts at 77^(1) and ends at either G[i — l,j] or G[i,j — 1], since G[i,j] 
has not yet received any shortcuts. 

Lemma 6. A dead path tt^ with k nodes can result in at most 2-k—l extensions. 

Proof. Since 7td is a path with k nodes, it spans at most k columns and k rows. 
When a dead path is removed, its endpoint is G[i— 1, j— 1], Let ir e denote the path 
of T that requires extensions. We know that both paths start at the same node: 
7Td(l) = 7r c (l). The endpoint of ir c is at either G[i — l,j] or G[i,j — l], since G[i, j] 
has not yet received shortcuts when the dead path is removed. Also, note that if 
the endpoint of tt c is not the parent of G[i,j] and has outdegree higher than 0, 
then it is a growth node and its descendants are also growth nodes. Hence, these 
descendants have parent that is a growth node and thus have shortcuts that 



need to be extended. Fig. 11 illustrates these situations. Hence, we know that 7r e 
spans either k + 1 columns and k rows or vice versa. Therefore, the maximum 
number of nodes in 7r c is 2 • k, since it must be monotonous. Since 7T e (l) does 
not have a shortcut to itself, there are at most 2-k—l incoming shortcuts from 

7T e at 7Td(l). □ 

Hence, we can charge every extension to one of the ft — 1 dead nodes (all but 
7Td(l)). A node gets at most 3 charges, since it is a (non-first) node of a dead path 
at most once. Because an extension can be done in constant time, the execution 
time of the algorithm is 0(mn). Note that shortcuts that originate from a living 
node with outdegree 1 could be removed instead of extended. We summarize the 
findings of this section in the following theorem. 

Theorem 3. Algorithm^ computes a locally correct discrete Frechet matching 
of two polygonal curves P and Q with m and n edges in 0{mn) time. 
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6 Conclusion 

We set out to find "good" matchings between two curves. To this end we in- 
troduced the local correctness criterion for Frechet matchings. We have proven 
that there always exists at least one locally correct Frechet matching between any 
two polygonal curves. This proof resulted in an 0(N 3 log N) algorithm, where 
N is the total number of edges in the two curves. Furthermore, we considered 
computing a locally correct matching using the discrete Frechet distance. By 
maintaining a tree with shortcuts to encode locally correct partial matchings, 
we have shown how to compute such a matching in 0(N 2 ) time. 

Future work. Computing a locally correct discrete Frechet matching takes 0(N 2 ) 
time, just like the dynamic program to compute only the discrete Frechet dis- 
tance. However, computing a locally correct continuous Frechet matching takes 
0(N 3 log N) time, a linear factor more than computing the Frechet distance. An 
interesting question is whether this gap in computation can be reduced as well. 

Furthermore, it would be interesting to investigate the benefit of local correct- 
ness for other matching-based similarity measures, such as the geodesic width [5] . 
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